Offline Quantum Reinforcement Learning in a Conservative Manner
نویسندگان
چکیده
Recently, to reap the quantum advantage, empowering reinforcement learning (RL) with computing has attracted much attention, which is dubbed as RL (QRL). However, current QRL algorithms employ an online scheme, i.e., policy that run on a computer needs interact environment collect experiences, could be expensive and dangerous for practical applications. In this paper, we aim solve problem in offline manner. To more specific, develop first (offline QRL) algorithm named CQ2L (Conservative Quantum Q-learning), learns from samples does not require any interaction environment. utilizes variational circuits (VQCs), are improved data re-uploading scaling parameters, represent Q-value functions of agents. suppress overestimation Q-values resulting data, double Q-learning framework reduce bias; then penalty term encourages generating conservative designed. We conduct abundant experiments demonstrate proposed method can successfully tasks counterpart not.
منابع مشابه
Offline Evaluation of Online Reinforcement Learning Algorithms
In many real-world reinforcement learning problems, we have access to an existing dataset and would like to use it to evaluate various learning approaches. Typically, one would prefer not to deploy a fixed policy, but rather an algorithm that learns to improve its behavior as it gains more experience. Therefore, we seek to evaluate how a proposed algorithm learns in our environment, meaning we ...
متن کاملReinforcement Learning in Neural Networks: A Survey
In recent years, researches on reinforcement learning (RL) have focused on bridging the gap between adaptive optimal control and bio-inspired learning techniques. Neural network reinforcement learning (NNRL) is among the most popular algorithms in the RL framework. The advantage of using neural networks enables the RL to search for optimal policies more efficiently in several real-life applicat...
متن کاملReinforcement Learning in Neural Networks: A Survey
In recent years, researches on reinforcement learning (RL) have focused on bridging the gap between adaptive optimal control and bio-inspired learning techniques. Neural network reinforcement learning (NNRL) is among the most popular algorithms in the RL framework. The advantage of using neural networks enables the RL to search for optimal policies more efficiently in several real-life applicat...
متن کاملGeneralized Quantum Reinforcement Learning with Quantum Technologies
We propose a protocol to perform generalized quantum reinforcement learning with quantum technologies. At variance with recent results on quantum reinforcement learning with superconducting circuits [L. Lamata, Sci. Rep. 7, 1609 (2017)], in our current protocol coherent feedback during the learning process is not required, enabling its implementation in a wide variety of quantum systems. We con...
متن کاملreinforcement learning in neural networks: a survey
in recent years, researches on reinforcement learning (rl) have focused on bridging the gap between adaptive optimal control and bio-inspired learning techniques. neural network reinforcement learning (nnrl) is among the most popular algorithms in the rl framework. the advantage of using neural networks enables the rl to search for optimal policies more efficiently in several real-life applicat...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence
سال: 2023
ISSN: ['2159-5399', '2374-3468']
DOI: https://doi.org/10.1609/aaai.v37i6.25872